NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AdaBB: Adaptive Barzilai-Borwein Method for Convex Optimization

https://doi.org/10.1287/moor.2024.0510

Zhou, Danqing; Ma, Shiqian; Yang, Junfeng (March 2025, Mathematics of Operations Research)

In this paper, we propose AdaBB, an adaptive gradient method based on the Barzilai-Borwein stepsize. The algorithm is line-search-free and parameter-free, and it essentially provides a convergent variant of the Barzilai-Borwein method for general convex optimization problems. We analyze the ergodic convergence of the objective function value and the convergence of the iterates for solving general convex optimization problems. Compared with existing works along this line of research, our algorithm gives the best lower bounds on the stepsize and the average of the stepsizes. Furthermore, we present extensions of the proposed algorithm for solving locally strongly convex and composite convex optimization problems where the objective function is the sum of a smooth function and a nonsmooth function. In the case of local strong convexity, we achieve linear convergence. Our numerical results also demonstrate very promising potential of the proposed algorithms on some representative examples. Funding: S. Ma is supported by the National Science Foundation [Grants DMS-2243650, CCF-2308597, CCF-2311275, and ECCS-2326591] and a startup fund from Rice University. J. Yang is supported by the National Natural Science Foundation of China [Grants 12431011 and 12371301] and the Natural Science Foundation for Distinguished Young Scholars of Gansu Province [Grant 22JR5RA223].
more » « less
Full Text Available
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning

Ding, Yangruibo; Peng, Jinjun; Min, Marcus; Kaiser, Gail; Yang, Junfeng; Ray, Baishakhi (December 2024, Advances in Neural Information Processing Systems, NeurIPS 2024)

Full Text Available
kGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution

Mathai, Alex; Huang, Chenxi; Maniatis, Petros; Nogikh, Aleksandr; Ivancic, Franjo; Yang, Junfeng; Ray, Baishakhi (December 2024, Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning

Ding, Yangruibo; Peng, Jinjun; Min, Marcus J; Kaiser, Gail; Yang, Junfeng; Ray, Baishakhi (September 2024, OpenReview.net)

Code Large Language Models (Code LLMs) have excelled at tasks like code completion but often miss deeper semantics such as execution effects and dynamic states. This paper aims to bridge the gap between Code LLMs' reliance on static text data and the need for semantic understanding for complex tasks like debugging and program repair. We introduce a novel strategy, monologue reasoning, to train Code LLMs to reason comprehensive semantics, encompassing high-level functional descriptions, local execution effects of individual statements, and overall input/output behavior, thereby linking static code text with dynamic execution states. We begin by collecting PyX, a clean Python corpus of fully executable code samples with functional descriptions and test cases. We propose training Code LLMs not only to write code but also to understand code semantics by reasoning about key properties, constraints, and execution behaviors using natural language, mimicking human verbal debugging, i.e., rubber-duck debugging. This approach led to the development of SemCoder, a Code LLM with only 6.7B parameters, which shows competitive performance with GPT-3.5-turbo on code generation and execution reasoning tasks. SemCoder achieves 79.3% on HumanEval (GPT-3.5-turbo: 76.8%), 63.6% on CRUXEval-I (GPT-3.5-turbo: 50.3%), and 63.9% on CRUXEval-O (GPT-3.5-turbo: 59.0%). We also study the effectiveness of SemCoder's monologue-style execution reasoning compared to concrete scratchpad reasoning, showing that our approach integrates semantics from multiple dimensions more smoothly. Finally, we demonstrate the potential of applying learned semantics to improve Code LLMs' debugging and self-refining capabilities. Our data, code, and models are available at: https://github.com/ARiSE-Lab/SemCoder.
more » « less
Full Text Available
RogueOne: Detecting Rogue Updates via Differential Data-flow Analysis Using Trust Domains

https://doi.org/10.1145/3597503.3639199

Sofaer, Raphael J; David, Yaniv; Kang, Mingqing; Yu, Jianjia; Cao, Yinzhi; Yang, Junfeng; Nieh, Jason (April 2024, ACM)

Full Text Available
Crowdsourcing-based Model Testing in Federated Learning

https://doi.org/10.1109/TrustCom60117.2023.00048

Yi, Yunpeng; Lv, Hongtao; Luo, Tie; Yang, Junfeng; Liu, Lei; Cui, Lizhen (November 2023, IEEE)

Full Text Available
Effective Performance Issue Diagnosis with Value-Assisted Cost Profiling

https://doi.org/10.1145/3552326.3587444

Weng, Lingmei; Hu, Yigong; Huang, Peng; Nieh, Jason; Yang, Junfeng (May 2023, Proceedings of the 18th European Conference on Computer Systems)

Full Text Available
Learning Approximate Execution Semantics From Traces for Binary Function Similarity

https://doi.org/10.1109/TSE.2022.3231621

Pei, Kexin; Xuan, Zhou; Yang, Junfeng; Jana, Suman; Ray, Baishakhi (April 2023, IEEE Transactions on Software Engineering)

Full Text Available
BPF-oF: Storage Function Pushdown Over the Network

Zarkadas, Ioannis; Zussman, Tal; Carin, Jeremy; Jiang, Sheng; Zhong, Yuhong; Pfefferle, Jonas; Franke, Hubertus; Yang, Junfeng; Kaffes, Kostis; Stutsman, Ryan; et al (September 2023, Arxiv)

Full Text Available
Golden Ratio Primal-Dual Algorithm with Linesearch

https://doi.org/10.1137/21M1420319

Chang, Xiao-Kai; Yang, Junfeng; Zhang, Hongchao (September 2022, SIAM Journal on Optimization)

Full Text Available

« Prev Next »

Search for: All records